Candidate Search and Elimination Approach for Telugu OCR

نویسندگان

  • Atul Negi
  • Chandra Kanth Chereddi
چکیده

In this paper we propose an OCR system for Telugu based on the candidate search and elimination technique. The initial candidates for recognition are found by applying a zoning method on input glyphs. We propose cavities as a structural approach suited specifically for Telugu script, where cavity vectors are used to prune the candidates found by zoning. A final template matching stage using controlled non linear normalization is performed to conclude the search process. The search can be concluded when at any stage ever an unique candidate is found. A recognition accuracy of 9798% was achieved on real images scanned from Telugu literature.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-font Optical Character Recognition System for Printed Telugu Text

The Telugu OCR systems available in the market currently recognize only the specific fonts of Telugu. This paper describes the development of a multi-font OCR system for printed Telugu characters using Artificial Neural Networks. In this system classification of the characters is carried out using multi layer neural network Architecture.

متن کامل

Optical Character Recognition (OCR) for Telugu: Database, Algorithm and Application

Telugu is a Dravidian language spoken by more than 80 million people worldwide. The optical character recognition (OCR) of the Telugu script has wide ranging applications including education, health-care, administration etc. The beautiful Telugu script however is very different from Germanic scripts like English and German. This makes the use of transfer learning of Germanic OCR solutions to Te...

متن کامل

An Overview of Optical Character Recognition Systems Research on Telugu Language

This paper gives an overview on the development process and ongoing research of the optical character recognition (OCR) systems for Telugu Text. The aim of this paper is to provide a starting point for the researchers entering into this field. In this paper, we present the introduction, characteristics of the Telugu language, developmental process of the OCR systems of Telugu language, research...

متن کامل

Segmentation of Touching Hand written Telugu Characters by using Drop Fall Algorithm

Recognition of Indian language scripts is a challenging problem. Work for the development of complete OCR systems for Indian language scripts is still in infancy. Complete OCR systems have recently been developed for Devanagri and Bangla scripts. Research in the field of recognition of Telugu script faces major problems mainly related to the touching and overlapping of characters. Segmentation ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003